Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 5425 |
| Missing cells | 11971 |
| Missing cells (%) | 9.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 974.9 KiB |
| Average record size in memory | 184.0 B |
Variable types
| CAT | 15 |
|---|---|
| NUM | 8 |
centro_escolar_acceso has a high cardinality: 137 distinct values | High cardinality |
fecha_nacimiento has a high cardinality: 3690 distinct values | High cardinality |
municipio has a high cardinality: 435 distinct values | High cardinality |
tipo_traslado has 4525 (83.4%) missing values | Missing |
nota_acceso has 1578 (29.1%) missing values | Missing |
nota_admision_def has 2725 (50.2%) missing values | Missing |
centro_escolar_acceso has 2461 (45.4%) missing values | Missing |
cod_provincia has 162 (3.0%) missing values | Missing |
provincia has 162 (3.0%) missing values | Missing |
cod_municipio has 179 (3.3%) missing values | Missing |
municipio has 179 (3.3%) missing values | Missing |
fecha_nacimiento is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2021-05-14 10:00:05.285061 |
|---|---|
| Analysis finished | 2021-05-14 10:00:13.656632 |
| Duration | 8.37 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
expediente
Real number (ℝ≥0)
| Distinct | 1718 |
|---|---|
| Distinct (%) | 31.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 597.3996313 |
|---|---|
| Minimum | 0 |
| Maximum | 1851 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 148 |
| median | 458 |
| Q3 | 964 |
| 95-th percentile | 1596 |
| Maximum | 1851 |
| Range | 1851 |
| Interquartile range (IQR) | 816 |
Descriptive statistics
| Standard deviation | 508.7121391 |
|---|---|
| Coefficient of variation (CV) | 0.8515441129 |
| Kurtosis | -0.6828524355 |
| Mean | 597.3996313 |
| Median Absolute Deviation (MAD) | 357 |
| Skewness | 0.6934634878 |
| Sum | 3240893 |
| Variance | 258788.0405 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9 | 14 | 0.3% | |
| 10 | 13 | 0.2% | |
| 28 | 13 | 0.2% | |
| 18 | 13 | 0.2% | |
| 3 | 13 | 0.2% | |
| 11 | 13 | 0.2% | |
| 6 | 13 | 0.2% | |
| 2 | 13 | 0.2% | |
| 20 | 13 | 0.2% | |
| 16 | 13 | 0.2% | |
| Other values (1708) | 5294 | 97.6% |
| Value | Count | Frequency (%) | |
| 0 | 3 | 0.1% | |
| 1 | 10 | 0.2% | |
| 2 | 13 | 0.2% | |
| 3 | 13 | 0.2% | |
| 4 | 13 | 0.2% |
| Value | Count | Frequency (%) | |
| 1851 | 1 | < 0.1% | |
| 1847 | 1 | < 0.1% | |
| 1846 | 1 | < 0.1% | |
| 1844 | 1 | < 0.1% | |
| 1840 | 1 | < 0.1% |
cod_plan
Real number (ℝ≥0)
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1627.792074 |
|---|---|
| Minimum | 1621 |
| Maximum | 1639 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 1621 |
|---|---|
| 5-th percentile | 1623 |
| Q1 | 1626 |
| median | 1627 |
| Q3 | 1632 |
| 95-th percentile | 1635 |
| Maximum | 1639 |
| Range | 18 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.639889297 |
|---|---|
| Coefficient of variation (CV) | 0.002236089827 |
| Kurtosis | 0.04872817441 |
| Mean | 1627.792074 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.8025982261 |
| Sum | 8830772 |
| Variance | 13.24879409 |
| Monotocity | Increasing |
| Value | Count | Frequency (%) | |
| 1626 | 1412 | 26.0% | |
| 1632 | 935 | 17.2% | |
| 1627 | 740 | 13.6% | |
| 1623 | 617 | 11.4% | |
| 1628 | 568 | 10.5% | |
| 1625 | 272 | 5.0% | |
| 1624 | 247 | 4.6% | |
| 1635 | 156 | 2.9% | |
| 1631 | 132 | 2.4% | |
| 1636 | 102 | 1.9% | |
| Other values (8) | 244 | 4.5% |
| Value | Count | Frequency (%) | |
| 1621 | 17 | 0.3% | |
| 1622 | 10 | 0.2% | |
| 1623 | 617 | 11.4% | |
| 1624 | 247 | 4.6% | |
| 1625 | 272 | 5.0% |
| Value | Count | Frequency (%) | |
| 1639 | 52 | 1.0% | |
| 1638 | 29 | 0.5% | |
| 1636 | 102 | 1.9% | |
| 1635 | 156 | 2.9% | |
| 1634 | 66 | 1.2% |
des_plan
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| GRADO EN EDIFICACIÓN | |
|---|---|
| GRADO EN INGENIERÍA INFORMÁTICA EN INGENIERÍA DEL SOFTWARE | |
| GRADO EN INGENIERÍA INFORMÁTICA EN INGENIERÍA DE COMPUTADORES | |
| GRADO EN INGENIERÍA CIVIL - CONSTRUCCIONES CIVILES | |
| GRADO EN INGENIERÍA DE SONIDO E IMAGEN EN TELECOMUNICACIÓN | |
| Other values (11) |
| Value | Count | Frequency (%) | |
| GRADO EN EDIFICACIÓN | 1412 | 26.0% | |
| GRADO EN INGENIERÍA INFORMÁTICA EN INGENIERÍA DEL SOFTWARE | 935 | 17.2% | |
| GRADO EN INGENIERÍA INFORMÁTICA EN INGENIERÍA DE COMPUTADORES | 740 | 13.6% | |
| GRADO EN INGENIERÍA CIVIL - CONSTRUCCIONES CIVILES | 617 | 11.4% | |
| GRADO EN INGENIERÍA DE SONIDO E IMAGEN EN TELECOMUNICACIÓN | 568 | 10.5% | |
| GRADO EN INGENIERÍA CIVIL - TRANSPORTES Y SERVICIOS URBANOS | 272 | 5.0% | |
| GRADO EN INGENIERÍA CIVIL - HIDROLOGÍA | 247 | 4.6% | |
| MÁSTER UNIVERSITARIO EN INVESTIGACIÓN EN INGENIERIA Y ARQUITECTURA | 161 | 3.0% | |
| MÁSTER UNIVERSITARIO EN INGENIERÍA DE TELECOMUNICACIÓN | 156 | 2.9% | |
| MÁSTER UNIVERSITARIO EN INGENIERÍA INFORMÁTICA | 102 | 1.9% | |
| Other values (6) | 215 | 4.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 74 |
|---|---|
| Median length | 58 |
| Mean length | 46.58470046 |
| Min length | 20 |
anio_apertura_expediente
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| 2010-11 | |
|---|---|
| 2011-12 | |
| 2013-14 | |
| 2009-10 | |
| 2012-13 | |
| Other values (9) |
| Value | Count | Frequency (%) | |
| 2010-11 | 1101 | 20.3% | |
| 2011-12 | 652 | 12.0% | |
| 2013-14 | 539 | 9.9% | |
| 2009-10 | 513 | 9.5% | |
| 2012-13 | 495 | 9.1% | |
| 2014-15 | 444 | 8.2% | |
| 2015-16 | 311 | 5.7% | |
| 2016-17 | 277 | 5.1% | |
| 2017-18 | 276 | 5.1% | |
| 2018-19 | 268 | 4.9% | |
| Other values (4) | 549 | 10.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
exp_cerrado
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| N | |
|---|---|
| S |
| Value | Count | Frequency (%) | |
| N | 3192 | 58.8% | |
| S | 2233 | 41.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
exp_trasladado
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| N | |
|---|---|
| S |
| Value | Count | Frequency (%) | |
| N | 4525 | 83.4% | |
| S | 900 | 16.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 4525 |
| Missing (%) | 83.4% |
| Memory size | 42.4 KiB |
| I | |
|---|---|
| S | |
| E |
| Value | Count | Frequency (%) | |
| I | 488 | 9.0% | |
| S | 219 | 4.0% | |
| E | 193 | 3.6% | |
| (Missing) | 4525 | 83.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.668202765 |
| Min length | 1 |
exp_bloqueado
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| N | |
|---|---|
| S | 91 |
| Value | Count | Frequency (%) | |
| N | 5334 | 98.3% | |
| S | 91 | 1.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
anio_convocatoria_acceso
Categorical
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| 2009-10 | |
|---|---|
| 2008-09 | |
| 2010-11 | |
| 2011-12 | |
| 2012-13 | |
| Other values (44) |
| Value | Count | Frequency (%) | |
| 2009-10 | 620 | 11.4% | |
| 2008-09 | 539 | 9.9% | |
| 2010-11 | 531 | 9.8% | |
| 2011-12 | 379 | 7.0% | |
| 2012-13 | 307 | 5.7% | |
| 2017-18 | 276 | 5.1% | |
| 2007-08 | 276 | 5.1% | |
| 2013-14 | 272 | 5.0% | |
| 2014-15 | 257 | 4.7% | |
| 2015-16 | 250 | 4.6% | |
| Other values (39) | 1718 | 31.7% |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
convocatoria_acceso
Categorical
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| JUN | |
|---|---|
| SEP | |
| EXT | 288 |
| FEB | 253 |
| OCT | 90 |
| Other values (12) | 221 |
| Value | Count | Frequency (%) | |
| JUN | 3530 | 65.1% | |
| SEP | 1043 | 19.2% | |
| EXT | 288 | 5.3% | |
| FEB | 253 | 4.7% | |
| OCT | 90 | 1.7% | |
| JUL | 83 | 1.5% | |
| FEX | 43 | 0.8% | |
| DIC | 39 | 0.7% | |
| ENE | 26 | 0.5% | |
| NOV | 10 | 0.2% | |
| Other values (7) | 20 | 0.4% |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
acceso
Real number (ℝ≥0)
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.367926267 |
|---|---|
| Minimum | 1 |
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.051944324 |
|---|---|
| Coefficient of variation (CV) | 0.8665575245 |
| Kurtosis | 10.12127567 |
| Mean | 2.367926267 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.117658595 |
| Sum | 12846 |
| Variance | 4.210475511 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 3408 | 62.8% | |
| 5 | 1389 | 25.6% | |
| 3 | 527 | 9.7% | |
| 10 | 38 | 0.7% | |
| 6 | 27 | 0.5% | |
| 4 | 14 | 0.3% | |
| 17 | 8 | 0.1% | |
| 20 | 7 | 0.1% | |
| 2 | 3 | 0.1% | |
| 7 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 3408 | 62.8% | |
| 2 | 3 | 0.1% | |
| 3 | 527 | 9.7% | |
| 4 | 14 | 0.3% | |
| 5 | 1389 | 25.6% |
| Value | Count | Frequency (%) | |
| 20 | 7 | 0.1% | |
| 17 | 8 | 0.1% | |
| 10 | 38 | 0.7% | |
| 9 | 2 | < 0.1% | |
| 7 | 2 | < 0.1% |
des_acceso
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| Selectividad | |
|---|---|
| Título Universitario | |
| Formación Profesional | |
| Traslado de Expediente (Estudios Españoles) | 38 |
| Acceso a Segundo Ciclo | 27 |
| Other values (6) | 36 |
| Value | Count | Frequency (%) | |
| Selectividad | 3408 | 62.8% | |
| Título Universitario | 1389 | 25.6% | |
| Formación Profesional | 527 | 9.7% | |
| Traslado de Expediente (Estudios Españoles) | 38 | 0.7% | |
| Acceso a Segundo Ciclo | 27 | 0.5% | |
| Mayores de 25/40/45 años | 14 | 0.3% | |
| Bachillerato Sin Prueba de Acceso | 8 | 0.1% | |
| Título de Bachiller Homologado (Extranjeros) | 7 | 0.1% | |
| COU sin selectividad | 3 | 0.1% | |
| Estudios universitarios extranjeros parcialmente convalidados | 2 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 61 |
|---|---|
| Median length | 12 |
| Mean length | 15.32921659 |
| Min length | 12 |
sub_acceso
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.375115207 |
|---|---|
| Minimum | 1 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 11.0081061 |
|---|---|
| Coefficient of variation (CV) | 2.51607228 |
| Kurtosis | 67.54114125 |
| Mean | 4.375115207 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 8.188160732 |
| Sum | 23735 |
| Variance | 121.1783998 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 2032 | 37.5% | |
| 5 | 1760 | 32.4% | |
| 2 | 849 | 15.6% | |
| 6 | 710 | 13.1% | |
| 99 | 70 | 1.3% | |
| 4 | 3 | 0.1% | |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 2032 | 37.5% | |
| 2 | 849 | 15.6% | |
| 3 | 1 | < 0.1% | |
| 4 | 3 | 0.1% | |
| 5 | 1760 | 32.4% |
| Value | Count | Frequency (%) | |
| 99 | 70 | 1.3% | |
| 6 | 710 | 13.1% | |
| 5 | 1760 | 32.4% | |
| 4 | 3 | 0.1% | |
| 3 | 1 | < 0.1% |
des_subacesso
Categorical
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| LOE (Grados) | |
|---|---|
| LOGSE | |
| Curso de Adaptación | |
| Bachillerato LOMCE | |
| Titulado universitario | |
| Other values (14) |
| Value | Count | Frequency (%) | |
| LOE (Grados) | 1760 | 32.4% | |
| LOGSE | 841 | 15.5% | |
| Curso de Adaptación | 733 | 13.5% | |
| Bachillerato LOMCE | 710 | 13.1% | |
| Titulado universitario | 656 | 12.1% | |
| Ciclos formativos | 445 | 8.2% | |
| . | 102 | 1.9% | |
| Formación Profesional II | 75 | 1.4% | |
| Prueba de Acceso a la Universidad | 35 | 0.6% | |
| COU | 31 | 0.6% | |
| Other values (9) | 37 | 0.7% |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 45 |
|---|---|
| Median length | 12 |
| Mean length | 14.45953917 |
| Min length | 1 |
| Distinct | 1573 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 1578 |
| Missing (%) | 29.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.615799324 |
|---|---|
| Minimum | 0 |
| Maximum | 12.272 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.29 |
| Q1 | 5.8255 |
| median | 6.44 |
| Q3 | 7.249 |
| 95-th percentile | 8.66 |
| Maximum | 12.272 |
| Range | 12.272 |
| Interquartile range (IQR) | 1.4235 |
Descriptive statistics
| Standard deviation | 1.056105224 |
|---|---|
| Coefficient of variation (CV) | 0.1596338057 |
| Kurtosis | 1.255282834 |
| Mean | 6.615799324 |
| Median Absolute Deviation (MAD) | 0.678 |
| Skewness | 0.5318499533 |
| Sum | 25450.98 |
| Variance | 1.115358244 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 7 | 28 | 0.5% | |
| 6.5 | 23 | 0.4% | |
| 5.8 | 22 | 0.4% | |
| 5.75 | 21 | 0.4% | |
| 5.9 | 20 | 0.4% | |
| 6 | 18 | 0.3% | |
| 6.25 | 16 | 0.3% | |
| 5.6 | 15 | 0.3% | |
| 6.69 | 14 | 0.3% | |
| 5.65 | 14 | 0.3% | |
| Other values (1563) | 3656 | 67.4% | |
| (Missing) | 1578 | 29.1% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 1.369 | 1 | < 0.1% | |
| 1.559 | 1 | < 0.1% | |
| 1.719 | 1 | < 0.1% | |
| 2.017 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 12.272 | 1 | < 0.1% | |
| 10 | 2 | < 0.1% | |
| 9.9 | 1 | < 0.1% | |
| 9.859 | 1 | < 0.1% | |
| 9.8 | 1 | < 0.1% |
| Distinct | 1615 |
|---|---|
| Distinct (%) | 59.8% |
| Missing | 2725 |
| Missing (%) | 50.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.702148889 |
|---|---|
| Minimum | 5 |
| Maximum | 13.859 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 5.3419 |
| Q1 | 6.1285 |
| median | 7.198 |
| Q3 | 8.91125 |
| 95-th percentile | 11.75615 |
| Maximum | 13.859 |
| Range | 8.859 |
| Interquartile range (IQR) | 2.78275 |
Descriptive statistics
| Standard deviation | 1.987739052 |
|---|---|
| Coefficient of variation (CV) | 0.258075906 |
| Kurtosis | -0.02239247871 |
| Mean | 7.702148889 |
| Median Absolute Deviation (MAD) | 1.268 |
| Skewness | 0.8771500269 |
| Sum | 20795.802 |
| Variance | 3.95110654 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 7 | 11 | 0.2% | |
| 6.54 | 10 | 0.2% | |
| 5.9 | 9 | 0.2% | |
| 6.45 | 8 | 0.1% | |
| 5.6 | 8 | 0.1% | |
| 5.85 | 8 | 0.1% | |
| 7.15 | 8 | 0.1% | |
| 6.23 | 8 | 0.1% | |
| 5.92 | 8 | 0.1% | |
| 5.8 | 8 | 0.1% | |
| Other values (1605) | 2614 | 48.2% | |
| (Missing) | 2725 | 50.2% |
| Value | Count | Frequency (%) | |
| 5 | 4 | 0.1% | |
| 5.004 | 1 | < 0.1% | |
| 5.018 | 2 | < 0.1% | |
| 5.02 | 3 | 0.1% | |
| 5.024 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 13.859 | 1 | < 0.1% | |
| 13.8 | 1 | < 0.1% | |
| 13.673 | 1 | < 0.1% | |
| 13.614 | 1 | < 0.1% | |
| 13.593 | 1 | < 0.1% |
| Distinct | 137 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 2461 |
| Missing (%) | 45.4% |
| Memory size | 42.4 KiB |
| 231-I.E.S. NORBA CAESARINA | 185 |
|---|---|
| 230-I.E.S. EL BROCENSE | 150 |
| 220-I.E.S. UNIVERSIDAD LABORAL | 103 |
| 232-I.E.S. PROFESOR HERNÁNDEZ PACHECO | 79 |
| 170-I.E.S. SANTA EULALIA | 69 |
| Other values (132) |
| Value | Count | Frequency (%) | |
| 231-I.E.S. NORBA CAESARINA | 185 | 3.4% | |
| 230-I.E.S. EL BROCENSE | 150 | 2.8% | |
| 220-I.E.S. UNIVERSIDAD LABORAL | 103 | 1.9% | |
| 232-I.E.S. PROFESOR HERNÁNDEZ PACHECO | 79 | 1.5% | |
| 170-I.E.S. SANTA EULALIA | 69 | 1.3% | |
| 203-I.E.S. ALAGÓN | 63 | 1.2% | |
| 240-I.E.S. ÁGORA | 56 | 1.0% | |
| 210-I.E.S. LUIS DE MORALES | 52 | 1.0% | |
| 238-COLEGIO LICENCIADOS REUNIDOS | 50 | 0.9% | |
| 234-COLEGIO SAN ANTONIO DE PADUA | 47 | 0.9% | |
| Other values (127) | 2110 | 38.9% | |
| (Missing) | 2461 | 45.4% |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.2% |
Length
| Max length | 40 |
|---|---|
| Median length | 19 |
| Mean length | 16.0764977 |
| Min length | 3 |
sexo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| H | |
|---|---|
| D |
| Value | Count | Frequency (%) | |
| H | 4311 | 79.5% | |
| D | 1114 | 20.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
| Distinct | 3690 |
|---|---|
| Distinct (%) | 68.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 42.4 KiB |
| 1992-06-13 | 9 |
|---|---|
| 1992-02-04 | 8 |
| 1990-10-15 | 8 |
| 1991-07-03 | 7 |
| 1992-03-30 | 7 |
| Other values (3685) |
| Value | Count | Frequency (%) | |
| 1992-06-13 | 9 | 0.2% | |
| 1992-02-04 | 8 | 0.1% | |
| 1990-10-15 | 8 | 0.1% | |
| 1991-07-03 | 7 | 0.1% | |
| 1992-03-30 | 7 | 0.1% | |
| 1992-04-22 | 7 | 0.1% | |
| 1993-10-24 | 7 | 0.1% | |
| 1992-12-18 | 7 | 0.1% | |
| 1994-08-11 | 6 | 0.1% | |
| 1995-06-17 | 6 | 0.1% | |
| Other values (3680) | 5353 | 98.7% |
Unique
| Unique | 2498 ? |
|---|---|
| Unique (%) | 46.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 162 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.67832035 |
|---|---|
| Minimum | 0 |
| Maximum | 60 |
| Zeros | 8 |
| Zeros (%) | 0.1% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 6 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 31.9 |
| Maximum | 60 |
| Range | 60 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 8.64053009 |
|---|---|
| Coefficient of variation (CV) | 0.8091656559 |
| Kurtosis | 9.667099711 |
| Mean | 10.67832035 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.098695934 |
| Sum | 56200 |
| Variance | 74.65876023 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10 | 2401 | 44.3% | |
| 6 | 2172 | 40.0% | |
| 28 | 138 | 2.5% | |
| 45 | 54 | 1.0% | |
| 37 | 51 | 0.9% | |
| 8 | 48 | 0.9% | |
| 11 | 44 | 0.8% | |
| 41 | 41 | 0.8% | |
| 20 | 25 | 0.5% | |
| 21 | 24 | 0.4% | |
| Other values (40) | 265 | 4.9% | |
| (Missing) | 162 | 3.0% |
| Value | Count | Frequency (%) | |
| 0 | 8 | 0.1% | |
| 1 | 3 | 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 9 | 0.2% | |
| 4 | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 60 | 14 | 0.3% | |
| 52 | 1 | < 0.1% | |
| 51 | 6 | 0.1% | |
| 50 | 5 | 0.1% | |
| 49 | 6 | 0.1% |
| Distinct | 50 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 162 |
| Missing (%) | 3.0% |
| Memory size | 42.4 KiB |
| CÁCERES | |
|---|---|
| BADAJOZ | |
| MADRID | 138 |
| TOLEDO | 54 |
| SALAMANCA | 51 |
| Other values (45) |
| Value | Count | Frequency (%) | |
| CÁCERES | 2401 | 44.3% | |
| BADAJOZ | 2172 | 40.0% | |
| MADRID | 138 | 2.5% | |
| TOLEDO | 54 | 1.0% | |
| SALAMANCA | 51 | 0.9% | |
| BARCELONA | 48 | 0.9% | |
| CÁDIZ | 44 | 0.8% | |
| SEVILLA | 41 | 0.8% | |
| GIPUZKOA | 25 | 0.5% | |
| HUELVA | 24 | 0.4% | |
| Other values (40) | 265 | 4.9% | |
| (Missing) | 162 | 3.0% |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Length
| Max length | 22 |
|---|---|
| Median length | 7 |
| Mean length | 6.900276498 |
| Min length | 3 |
| Distinct | 278 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 179 |
| Missing (%) | 3.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 233.7384674 |
|---|---|
| Minimum | 1 |
| Maximum | 912 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 90 |
| Q3 | 450 |
| 95-th percentile | 760 |
| Maximum | 912 |
| Range | 911 |
| Interquartile range (IQR) | 449 |
Descriptive statistics
| Standard deviation | 265.2541775 |
|---|---|
| Coefficient of variation (CV) | 1.13483322 |
| Kurtosis | -0.9772410625 |
| Mean | 233.7384674 |
| Median Absolute Deviation (MAD) | 89 |
| Skewness | 0.6763307704 |
| Sum | 1226192 |
| Variance | 70359.77868 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 2361 | 43.5% | |
| 584 | 308 | 5.7% | |
| 410 | 306 | 5.6% | |
| 215 | 170 | 3.1% | |
| 516 | 125 | 2.3% | |
| 264 | 104 | 1.9% | |
| 760 | 102 | 1.9% | |
| 55 | 62 | 1.1% | |
| 435 | 53 | 1.0% | |
| 785 | 44 | 0.8% | |
| Other values (268) | 1611 | 29.7% | |
| (Missing) | 179 | 3.3% |
| Value | Count | Frequency (%) | |
| 1 | 2361 | 43.5% | |
| 5 | 1 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 10 | 15 | 0.3% | |
| 12 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 912 | 3 | 0.1% | |
| 888 | 1 | < 0.1% | |
| 884 | 1 | < 0.1% | |
| 875 | 2 | < 0.1% | |
| 868 | 3 | 0.1% |
| Distinct | 435 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 179 |
| Missing (%) | 3.3% |
| Memory size | 42.4 KiB |
| CÁCERES | |
|---|---|
| BADAJOZ | |
| PLASENCIA | |
| MÉRIDA | |
| DON BENITO | 170 |
| Other values (430) |
| Value | Count | Frequency (%) | |
| CÁCERES | 1365 | 25.2% | |
| BADAJOZ | 606 | 11.2% | |
| PLASENCIA | 308 | 5.7% | |
| MÉRIDA | 306 | 5.6% | |
| DON BENITO | 170 | 3.1% | |
| NAVALMORAL DE LA MATA | 125 | 2.3% | |
| MADRID | 108 | 2.0% | |
| CORIA | 104 | 1.9% | |
| VILLANUEVA DE LA SERENA | 101 | 1.9% | |
| ALMENDRALEJO | 62 | 1.1% | |
| Other values (425) | 1991 | 36.7% | |
| (Missing) | 179 | 3.3% |
Unique
| Unique | 170 ? |
|---|---|
| Unique (%) | 3.2% |
Length
| Max length | 26 |
|---|---|
| Median length | 7 |
| Mean length | 9.693271889 |
| Min length | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| expediente | cod_plan | des_plan | anio_apertura_expediente | exp_cerrado | exp_trasladado | tipo_traslado | exp_bloqueado | anio_convocatoria_acceso | convocatoria_acceso | acceso | des_acceso | sub_acceso | des_subacesso | nota_acceso | nota_admision_def | centro_escolar_acceso | sexo | fecha_nacimiento | cod_provincia | provincia | cod_municipio | municipio | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 2004-05 | SEP | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1981-10-23 | 10.0 | CÁCERES | 1.0 | CÁCERES |
| 1 | 3 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 2000-01 | DIC | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1977-07-09 | 10.0 | CÁCERES | 1.0 | CÁCERES |
| 2 | 5 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 1994-95 | FEB | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | D | 1970-08-26 | 6.0 | BADAJOZ | 345.0 | JEREZ DE LOS CABALLEROS |
| 3 | 6 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | N | N | NaN | N | 2003-04 | JUN | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1978-11-16 | 6.0 | BADAJOZ | 740.0 | VILLAFRANCA DE LOS BARROS |
| 4 | 7 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 1996-97 | OCT | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1971-11-01 | 6.0 | BADAJOZ | 1.0 | BADAJOZ |
| 5 | 8 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 2005-06 | SEP | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1984-04-26 | 10.0 | CÁCERES | 1.0 | CÁCERES |
| 6 | 9 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 2006-07 | DIC | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1984-05-30 | 6.0 | BADAJOZ | 760.0 | VILLANUEVA DE LA SERENA |
| 7 | 11 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | N | N | NaN | N | 2006-07 | DIC | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1979-06-21 | 17.0 | GIRONA | 252.0 | FIGUERES |
| 8 | 12 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 1989-90 | JUN | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1965-12-05 | 6.0 | BADAJOZ | 410.0 | MÉRIDA |
| 9 | 13 | 1621 | MÁSTER EN COMPUTACIÓN GRID Y PARALELISMO | 2007-08 | S | N | NaN | N | 1987-88 | FEB | 6 | Acceso a Segundo Ciclo | 1 | . | NaN | NaN | NaN | H | 1963-12-14 | 60.0 | EXTRANJEROS | NaN | NaN |
Last rows
| expediente | cod_plan | des_plan | anio_apertura_expediente | exp_cerrado | exp_trasladado | tipo_traslado | exp_bloqueado | anio_convocatoria_acceso | convocatoria_acceso | acceso | des_acceso | sub_acceso | des_subacesso | nota_acceso | nota_admision_def | centro_escolar_acceso | sexo | fecha_nacimiento | cod_provincia | provincia | cod_municipio | municipio | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5415 | 118 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 1992-93 | JUN | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1970-02-18 | 10.0 | CÁCERES | 256.0 | COLLADO DE LA VERA |
| 5416 | 120 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2016-17 | JUL | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1990-09-03 | 3.0 | ALICANTE | 310.0 | DÉNIA |
| 5417 | 121 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2018-19 | SEP | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1991-08-13 | 6.0 | BADAJOZ | 450.0 | NAVALVILLAR DE PELA |
| 5418 | 124 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2019-20 | JUN | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | D | 1994-10-26 | 10.0 | CÁCERES | 444.0 | MADROÑERA |
| 5419 | 125 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2018-19 | JUL | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1993-03-23 | 10.0 | CÁCERES | 772.0 | TRUJILLO |
| 5420 | 127 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2018-19 | SEP | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | D | 1996-11-11 | 6.0 | BADAJOZ | 740.0 | VILLAFRANCA DE LOS BARROS |
| 5421 | 128 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2019-20 | JUL | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1996-02-13 | 10.0 | CÁCERES | 1.0 | CÁCERES |
| 5422 | 129 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2019-20 | SEP | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1992-05-10 | 6.0 | BADAJOZ | 135.0 | CAMPANARIO |
| 5423 | 130 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2020-21 | NOV | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | D | 1997-04-11 | 10.0 | CÁCERES | 700.0 | SIERRA DE FUENTES |
| 5424 | 131 | 1639 | MÁSTER UNIV. EN METODOLOGÍA BIM EN EL DESARROLLO COLABORATIVO DE PROYECTOS | 2020-21 | N | N | NaN | N | 2020-21 | NOV | 5 | Título Universitario | 1 | Titulado universitario | NaN | NaN | NaN | H | 1992-06-13 | 6.0 | BADAJOZ | 265.0 | FUENTE DEL MAESTRE |